Overview

Dataset statistics

Number of variables22
Number of observations8221556
Missing cells4
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 GiB
Average record size in memory284.0 B

Variable types

NUM12
BOOL8
CAT2

Reproduction

Analysis started2020-04-22 15:07:07.288047
Analysis finished2020-04-22 15:51:21.637567
Versionpandas-profiling v2.6.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
FLIGHT has a high cardinality: 1352 distinct values High cardinality
PAR_AC_2 is highly correlated with PAR_AC_1High Correlation
PAR_AC_1 is highly correlated with PAR_AC_2High Correlation
PAR_AC_4 is highly correlated with PAR_AC_3High Correlation
PAR_AC_3 is highly correlated with PAR_AC_4High Correlation
PAR_SYS_10 is highly correlated with PAR_SYS_9High Correlation
PAR_SYS_9 is highly correlated with PAR_SYS_10High Correlation
PAR_SYS_6 is highly correlated with PAR_SYS_5High Correlation
PAR_SYS_5 is highly correlated with PAR_SYS_6High Correlation
TIME is highly skewed (γ1 = 697.5309992) Skewed
PAR_AC_1 has 253549 (3.1%) zeros Zeros
PAR_AC_2 has 260751 (3.2%) zeros Zeros
PAR_SYS_9 has 4339222 (52.8%) zeros Zeros
PAR_SYS_10 has 4330986 (52.7%) zeros Zeros
PAR_SYS_7 has 259639 (3.2%) zeros Zeros
PAR_SYS_8 has 159703 (1.9%) zeros Zeros

Variables

AC
Categorical

Distinct count11
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size62.7 MiB
AC21
1551232
AC23
1461359
AC22
1225095
AC19
961691
AC20
693698
Other values (6)
2328481
ValueCountFrequency (%) 
AC21 1551232 18.9%
 
AC23 1461359 17.8%
 
AC22 1225095 14.9%
 
AC19 961691 11.7%
 
AC20 693698 8.4%
 
AC30 509333 6.2%
 
AC24 507650 6.2%
 
AC35 451742 5.5%
 
AC36 370805 4.5%
 
AC32 283229 3.4%
 

Length

Max length4
Mean length4
Min length4
ValueCountFrequency (%) 
Decimal_Number 8 80.0%
 
Uppercase_Letter 2 20.0%
 
ValueCountFrequency (%) 
Common 8 80.0%
 
Latin 2 20.0%
 
ValueCountFrequency (%) 
ASCII 10 100.0%
 

FLIGHT
Categorical

HIGH CARDINALITY
Distinct count1352
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size62.7 MiB
78882d
 
15925
b17ec6
 
15545
f420bf
 
15149
682bcd
 
14905
6a86c9
 
14781
Other values (1347)
8145251
ValueCountFrequency (%) 
78882d 15925 0.2%
 
b17ec6 15545 0.2%
 
f420bf 15149 0.2%
 
682bcd 14905 0.2%
 
6a86c9 14781 0.2%
 
3e126d 14765 0.2%
 
7dff94 14677 0.2%
 
d5cb8b 14665 0.2%
 
fcfd38 14625 0.2%
 
7a0652 14361 0.2%
 
Other values (1342) 8072158 98.2%
 

Length

Max length6
Mean length6
Min length6
ValueCountFrequency (%) 
Decimal_Number 10 62.5%
 
Lowercase_Letter 6 37.5%
 
ValueCountFrequency (%) 
Common 10 62.5%
 
Latin 6 37.5%
 
ValueCountFrequency (%) 
ASCII 16 100.0%
 

TIME
Real number (ℝ≥0)

SKEWED
Distinct count47624
Unique (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7378.576313
Minimum0
Maximum31536002
Zeros1356
Zeros (%)< 0.1%
Memory size62.7 MiB

Quantile statistics

Minimum0
5-th percentile1035
Q13056
median4783
Q36971
95-th percentile12084
Maximum31536002
Range31536002
Interquartile range (IQR)3915

Descriptive statistics

Standard deviation17635.89212
Coefficient of variation (CV)2.390148366
Kurtosis1242452.129
Mean7378.576313
Median Absolute Deviation (MAD)5296.622469
Skewness697.5309992
Sum6.066337836e+10
Variance311024690.9
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.0000000e+00 5.0000000e-01 1.5000000e+00 2.5000000e+00 3.5000000e+00 ... 9.2972500e+04 9.4671500e+04 9.5268500e+04 1.0008750e+05 3.1536002e+07], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 1356 < 0.1%
 
3599 1293 < 0.1%
 
3609 1259 < 0.1%
 
3597 1259 < 0.1%
 
3598 1259 < 0.1%
 
3591 1258 < 0.1%
 
3593 1258 < 0.1%
 
3611 1258 < 0.1%
 
3590 1257 < 0.1%
 
3595 1257 < 0.1%
 
Other values (47614) 8208842 99.8%
 
ValueCountFrequency (%) 
0 1356 < 0.1%
 
1 908 < 0.1%
 
2 572 < 0.1%
 
3 229 < 0.1%
 
4 142 < 0.1%
 
ValueCountFrequency (%) 
31536002 1 < 0.1%
 
100088 1 < 0.1%
 
100087 1 < 0.1%
 
100086 1 < 0.1%
 
100085 1 < 0.1%
 

AMBIENT_1
Real number (ℝ)

Distinct count81239
Unique (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34542.41204
Minimum-831
Maximum71392
Zeros0
Zeros (%)0.0%
Memory size62.7 MiB

Quantile statistics

Minimum-831
5-th percentile11922
Q117975.5
median40569
Q348055
95-th percentile50107.5
Maximum71392
Range72223
Interquartile range (IQR)30079.5

Descriptive statistics

Standard deviation15019.20515
Coefficient of variation (CV)0.4348047591
Kurtosis-1.516863706
Mean34542.41204
Median Absolute Deviation (MAD)13798.68256
Skewness-0.4103348137
Sum2.83992375e+11
Variance225576523.2
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ -831. 5520.5 8655.5 11455.5 11463.75 ... 53002.75 53004.75 53006.25 53011. 71392. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
47998 71922 0.9%
 
47999 42649 0.5%
 
47999.5 41722 0.5%
 
47998.5 41640 0.5%
 
48000 39170 0.5%
 
48000.5 34937 0.4%
 
49999 33504 0.4%
 
49999.5 32681 0.4%
 
49998.5 31726 0.4%
 
50000 31512 0.4%
 
Other values (81229) 7820093 95.1%
 
ValueCountFrequency (%) 
-831 2 < 0.1%
 
3905 2 < 0.1%
 
4272 1 < 0.1%
 
5185 2 < 0.1%
 
5856 29 < 0.1%
 
ValueCountFrequency (%) 
71392 5 < 0.1%
 
53011.5 1 < 0.1%
 
53010.5 1 < 0.1%
 
53010 3 < 0.1%
 
53009.5 5 < 0.1%
 

PAR_AC_1
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS
Distinct count699
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean91.0096417
Minimum0
Maximum129.63625
Zeros253549
Zeros (%)3.1%
Memory size62.7 MiB

Quantile statistics

Minimum0
5-th percentile29.455
Q156.17
median115.4225
Q3119.19
95-th percentile126.89625
Maximum129.63625
Range129.63625
Interquartile range (IQR)63.02

Descriptive statistics

Standard deviation38.11192096
Coefficient of variation (CV)0.4187679486
Kurtosis-0.8039133559
Mean91.0096417
Median Absolute Deviation (MAD)34.37121645
Skewness-0.8230965291
Sum748240865.7
Variance1452.518519
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 5.1375 10.360625 11.388125 12.758125 ... 129.036875 129.208125 129.379375 129.550625 129.63625 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 253549 3.1%
 
117.4775 105138 1.3%
 
117.64875 104739 1.3%
 
117.82 104729 1.3%
 
117.30625 104457 1.3%
 
117.99125 104267 1.3%
 
117.135 104042 1.3%
 
118.1625 103139 1.3%
 
116.96375 102984 1.3%
 
116.7925 99819 1.2%
 
Other values (689) 7034693 85.6%
 
ValueCountFrequency (%) 
0 253549 3.1%
 
10.275 399 < 0.1%
 
10.44625 568 < 0.1%
 
10.6175 599 < 0.1%
 
10.78875 571 < 0.1%
 
ValueCountFrequency (%) 
129.63625 4 < 0.1%
 
129.465 73 < 0.1%
 
129.29375 1141 < 0.1%
 
129.1225 9243 0.1%
 
128.95125 28492 0.3%
 

PAR_AC_2
Real number (ℝ)

HIGH CORRELATION
ZEROS
Distinct count705
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean90.90196443
Minimum-276.74
Maximum262.355
Zeros260751
Zeros (%)3.2%
Memory size62.7 MiB

Quantile statistics

Minimum-276.74
5-th percentile29.28375
Q156.17
median115.4225
Q3119.19
95-th percentile126.89625
Maximum262.355
Range539.095
Interquartile range (IQR)63.02

Descriptive statistics

Standard deviation38.27212114
Coefficient of variation (CV)0.4210263374
Kurtosis-0.800836377
Mean90.90196443
Median Absolute Deviation (MAD)34.4994654
Skewness-0.8251029976
Sum747355591.1
Variance1464.755257
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-2.76740000e+02 -6.85000000e+00 1.71250000e-01 5.30875000e+00 1.03606250e+01 ... 1.29208125e+02 1.29379375e+02 1.29550625e+02 1.29721875e+02 2.62355000e+02], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 260751 3.2%
 
117.82 105037 1.3%
 
117.4775 104843 1.3%
 
117.30625 104556 1.3%
 
117.64875 104546 1.3%
 
117.135 104266 1.3%
 
117.99125 104182 1.3%
 
118.1625 103233 1.3%
 
116.96375 102560 1.2%
 
116.7925 100138 1.2%
 
Other values (695) 7027444 85.5%
 
ValueCountFrequency (%) 
-276.74 2 < 0.1%
 
-57.54 2 < 0.1%
 
-13.7 2 < 0.1%
 
0 260751 3.2%
 
0.3425 3 < 0.1%
 
ValueCountFrequency (%) 
262.355 1 < 0.1%
 
129.8075 1 < 0.1%
 
129.63625 2 < 0.1%
 
129.465 38 < 0.1%
 
129.29375 1044 < 0.1%
 

PAR_AC_3
Boolean

HIGH CORRELATION
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size62.7 MiB
0
8141539
1
 
80017
ValueCountFrequency (%) 
0 8141539 99.0%
 
1 80017 1.0%
 

PAR_AC_4
Boolean

HIGH CORRELATION
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size62.7 MiB
0
8141545
1
 
80011
ValueCountFrequency (%) 
0 8141545 99.0%
 
1 80011 1.0%
 

PAR_SYS_1
Real number (ℝ≥0)

Distinct count713
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean56.11225194
Minimum20
Maximum149.25
Zeros0
Zeros (%)0.0%
Memory size62.7 MiB

Quantile statistics

Minimum20
5-th percentile42
Q154
median55.375
Q360
95-th percentile67
Maximum149.25
Range129.25
Interquartile range (IQR)6

Descriptive statistics

Standard deviation7.218618777
Coefficient of variation (CV)0.1286460359
Kurtosis3.226073251
Mean56.11225194
Median Absolute Deviation (MAD)4.840167217
Skewness-0.6656633492
Sum461330021.6
Variance52.10845705
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 20. 20.0625 20.4375 20.5625 20.6875 ... 115.6875 120.1875 125.125 135.8125 149.25 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
54.75 223297 2.7%
 
54.5 201833 2.5%
 
55 198114 2.4%
 
55.125 197735 2.4%
 
54.375 192950 2.3%
 
54.125 192572 2.3%
 
55.375 191559 2.3%
 
54.625 181887 2.2%
 
54.875 180888 2.2%
 
55.25 167669 2.0%
 
Other values (703) 6293052 76.5%
 
ValueCountFrequency (%) 
20 2845 < 0.1%
 
20.125 356 < 0.1%
 
20.25 354 < 0.1%
 
20.375 434 < 0.1%
 
20.5 578 < 0.1%
 
ValueCountFrequency (%) 
149.25 4 < 0.1%
 
146.375 4 < 0.1%
 
146.125 4 < 0.1%
 
145 4 < 0.1%
 
138.75 4 < 0.1%
 

PAR_SYS_2
Real number (ℝ≥0)

Distinct count238
Unique (%)< 0.1%
Missing2
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean311.8746598
Minimum122
Maximum382
Zeros0
Zeros (%)0.0%
Memory size62.7 MiB

Quantile statistics

Minimum122
5-th percentile255
Q1309
median320
Q3326
95-th percentile335
Maximum382
Range260
Interquartile range (IQR)17

Descriptive statistics

Standard deviation26.42852368
Coefficient of variation (CV)0.08474084973
Kurtosis6.545088119
Mean311.8746598
Median Absolute Deviation (MAD)17.87202457
Skewness-2.314777009
Sum2564094357
Variance698.466864
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
322 429958 5.2%
 
323 380790 4.6%
 
324 376991 4.6%
 
325 350757 4.3%
 
321 335126 4.1%
 
320 327262 4.0%
 
326 318893 3.9%
 
327 308289 3.7%
 
328 276292 3.4%
 
319 267449 3.3%
 
Other values (228) 4849747 59.0%
 
ValueCountFrequency (%) 
122 2 < 0.1%
 
138 48 < 0.1%
 
139 5 < 0.1%
 
140 64 < 0.1%
 
141 148 < 0.1%
 
ValueCountFrequency (%) 
382 4 < 0.1%
 
381 4 < 0.1%
 
372 12 < 0.1%
 
371 44 < 0.1%
 
370 64 < 0.1%
 

PAR_SYS_3
Real number (ℝ)

Distinct count709
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean55.67084327
Minimum-219.625
Maximum156.5
Zeros0
Zeros (%)0.0%
Memory size62.7 MiB

Quantile statistics

Minimum-219.625
5-th percentile41.75
Q154
median55.25
Q359.25
95-th percentile66.5
Maximum156.5
Range376.125
Interquartile range (IQR)5.25

Descriptive statistics

Standard deviation7.456876626
Coefficient of variation (CV)0.1339458177
Kurtosis4.581991927
Mean55.67084327
Median Absolute Deviation (MAD)4.827394936
Skewness-0.8733125774
Sum457700955.5
Variance55.60500902
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-219.625 -28.8125 -1.9375 20.0625 20.1875 ... 114.1875 126.4375 140. 153.5 156.5 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
54.75 222447 2.7%
 
54.5 211280 2.6%
 
55.125 202387 2.5%
 
54.125 202368 2.5%
 
55 200938 2.4%
 
54.375 196910 2.4%
 
55.375 194179 2.4%
 
54.875 183920 2.2%
 
54.625 182479 2.2%
 
54.25 176477 2.1%
 
Other values (699) 6248171 76.0%
 
ValueCountFrequency (%) 
-219.625 4 < 0.1%
 
-119.875 5 < 0.1%
 
-33.75 5 < 0.1%
 
-23.875 10 < 0.1%
 
20 2397 < 0.1%
 
ValueCountFrequency (%) 
156.5 4 < 0.1%
 
156 4 < 0.1%
 
155.75 4 < 0.1%
 
155.625 4 < 0.1%
 
155.25 8 < 0.1%
 

PAR_SYS_4
Real number (ℝ≥0)

Distinct count234
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean311.8745313
Minimum120
Maximum378
Zeros0
Zeros (%)0.0%
Memory size62.7 MiB

Quantile statistics

Minimum120
5-th percentile249
Q1310
median321
Q3326
95-th percentile335
Maximum378
Range258
Interquartile range (IQR)16

Descriptive statistics

Standard deviation27.6604775
Coefficient of variation (CV)0.08869104312
Kurtosis6.829035816
Mean311.8745313
Median Absolute Deviation (MAD)18.21603561
Skewness-2.452484021
Sum2564093924
Variance765.1020157
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[120. 130. 140.5 143.5 144.5 ... 365.5 366.5 368.5 370.5 378. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
322 486044 5.9%
 
323 460310 5.6%
 
324 444135 5.4%
 
325 394700 4.8%
 
321 381826 4.6%
 
320 365355 4.4%
 
326 337416 4.1%
 
327 312215 3.8%
 
319 280733 3.4%
 
328 257711 3.1%
 
Other values (224) 4501111 54.7%
 
ValueCountFrequency (%) 
120 1 < 0.1%
 
140 76 < 0.1%
 
141 146 < 0.1%
 
142 144 < 0.1%
 
143 174 < 0.1%
 
ValueCountFrequency (%) 
378 4 < 0.1%
 
371 20 < 0.1%
 
370 64 < 0.1%
 
369 92 < 0.1%
 
368 184 < 0.1%
 

WAR_SYS_1
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size62.7 MiB
0
8221425
1
 
131
ValueCountFrequency (%) 
0 8221425 > 99.9%
 
1 131 < 0.1%
 

COM_SYS_1
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size62.7 MiB
0
8220475
1
 
1081
ValueCountFrequency (%) 
0 8220475 > 99.9%
 
1 1081 < 0.1%
 

PAR_SYS_9
Real number (ℝ)

HIGH CORRELATION
ZEROS
Distinct count102
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.117964574
Minimum-6
Maximum7.5
Zeros4339222
Zeros (%)52.8%
Memory size62.7 MiB

Quantile statistics

Minimum-6
5-th percentile0
Q10
median0
Q33.6
95-th percentile6.45
Maximum7.5
Range13.5
Interquartile range (IQR)3.6

Descriptive statistics

Standard deviation2.409542962
Coefficient of variation (CV)1.137669153
Kurtosis-1.213266635
Mean2.117964574
Median Absolute Deviation (MAD)2.23598504
Skewness0.5373590427
Sum17412964.35
Variance5.805897286
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-6. -0.6 0.0375 0.4125 0.6375 ... 7.0125 7.0875 7.1625 7.4625 7.5 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 4339222 52.8%
 
3.45 519495 6.3%
 
3.375 475641 5.8%
 
3.525 449240 5.5%
 
3.6 272144 3.3%
 
6.525 235623 2.9%
 
6.15 206274 2.5%
 
3.75 192618 2.3%
 
3.675 192229 2.3%
 
6.075 180503 2.2%
 
Other values (92) 1158567 14.1%
 
ValueCountFrequency (%) 
-6 2 < 0.1%
 
-1.2 2 < 0.1%
 
0 4339222 52.8%
 
0.075 4 < 0.1%
 
0.15 2 < 0.1%
 
ValueCountFrequency (%) 
7.5 43 < 0.1%
 
7.425 26 < 0.1%
 
7.35 33 < 0.1%
 
7.275 40 < 0.1%
 
7.2 56 < 0.1%
 

PAR_SYS_10
Real number (ℝ)

HIGH CORRELATION
ZEROS
Distinct count106
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.16059904
Minimum-6.075
Maximum9.225
Zeros4330986
Zeros (%)52.7%
Memory size62.7 MiB

Quantile statistics

Minimum-6.075
5-th percentile0
Q10
median0
Q33.675
95-th percentile6.3
Maximum9.225
Range15.3
Interquartile range (IQR)3.675

Descriptive statistics

Standard deviation2.434841666
Coefficient of variation (CV)1.126928977
Kurtosis-1.328236337
Mean2.16059904
Median Absolute Deviation (MAD)2.276835756
Skewness0.4855266553
Sum17763486
Variance5.928453937
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-6.075 -0.1875 0.0375 0.4125 0.6375 ... 7.0875 7.2375 7.3875 8.3625 9.225 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 4330986 52.7%
 
3.6 621594 7.6%
 
3.675 480267 5.8%
 
3.525 458682 5.6%
 
3.45 341068 4.1%
 
6.15 238980 2.9%
 
6.525 231203 2.8%
 
6.075 208359 2.5%
 
6 154957 1.9%
 
6.225 130338 1.6%
 
Other values (96) 1025122 12.5%
 
ValueCountFrequency (%) 
-6.075 2 < 0.1%
 
-5.175 2 < 0.1%
 
-3.975 2 < 0.1%
 
-0.375 2 < 0.1%
 
0 4330986 52.7%
 
ValueCountFrequency (%) 
9.225 2 < 0.1%
 
7.5 30 < 0.1%
 
7.425 3 < 0.1%
 
7.35 15 < 0.1%
 
7.275 17 < 0.1%
 

PAR_SYS_5
Boolean

HIGH CORRELATION
Distinct count2
Unique (%)< 0.1%
Missing2
Missing (%)< 0.1%
Memory size62.7 MiB
1
7728331
0
 
493223
(Missing)
 
2
ValueCountFrequency (%) 
1 7728331 94.0%
 
0 493223 6.0%
 
(Missing) 2 < 0.1%
 

WAR_SYS_2
Boolean

CONSTANT
REJECTED
Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size62.7 MiB
0
8221556
ValueCountFrequency (%) 
0 8221556 100.0%
 

PAR_SYS_6
Boolean

HIGH CORRELATION
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size62.7 MiB
1
7714417
0
 
507139
ValueCountFrequency (%) 
1 7714417 93.8%
 
0 507139 6.2%
 

WAR_SYS_3
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size62.7 MiB
0
8221553
1
 
3
ValueCountFrequency (%) 
0 8221553 > 99.9%
 
1 3 < 0.1%
 

PAR_SYS_7
Real number (ℝ≥0)

ZEROS
Distinct count1016
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean109.8274322
Minimum0
Maximum537.3
Zeros259639
Zeros (%)3.2%
Memory size62.7 MiB

Quantile statistics

Minimum0
5-th percentile79.3125
Q192.475
median100.2375
Q3136.35
95-th percentile156.9375
Maximum537.3
Range537.3
Interquartile range (IQR)43.875

Descriptive statistics

Standard deviation33.32104555
Coefficient of variation (CV)0.3033945607
Kurtosis2.833154087
Mean109.8274322
Median Absolute Deviation (MAD)25.15502502
Skewness-0.6126988129
Sum902952384.3
Variance1110.292077
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 1.18125 2.53125 2.86875 3.20625 ... 339.01875 342.39375 358.59375 390.65625 537.3 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 259639 3.2%
 
93.15 131180 1.6%
 
93.4875 129894 1.6%
 
92.8125 128664 1.6%
 
93.825 128552 1.6%
 
92.475 127406 1.5%
 
94.1625 125224 1.5%
 
92.1375 125123 1.5%
 
91.8 121827 1.5%
 
94.5 121147 1.5%
 
Other values (1006) 6822900 83.0%
 
ValueCountFrequency (%) 
0 259639 3.2%
 
2.3625 55 < 0.1%
 
2.7 95 < 0.1%
 
3.0375 343 < 0.1%
 
3.375 191 < 0.1%
 
ValueCountFrequency (%) 
537.3 4 < 0.1%
 
392.5125 4 < 0.1%
 
388.8 4 < 0.1%
 
387.7875 4 < 0.1%
 
384.75 4 < 0.1%
 

PAR_SYS_8
Real number (ℝ≥0)

ZEROS
Distinct count944
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean109.7345003
Minimum0
Maximum393.525
Zeros159703
Zeros (%)1.9%
Memory size62.7 MiB

Quantile statistics

Minimum0
5-th percentile81.675
Q192.475
median100.2375
Q3133.65
95-th percentile156.9375
Maximum393.525
Range393.525
Interquartile range (IQR)41.175

Descriptive statistics

Standard deviation30.8554218
Coefficient of variation (CV)0.281182506
Kurtosis2.682709994
Mean109.7345003
Median Absolute Deviation (MAD)23.59240749
Skewness-0.3911270892
Sum902188339.6
Variance952.0570545
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 1.51875 3.20625 3.88125 4.55625 ... 328.21875 330.91875 367.70625 386.775 393.525 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 159703 1.9%
 
92.475 132622 1.6%
 
92.1375 132620 1.6%
 
92.8125 131502 1.6%
 
93.15 131134 1.6%
 
91.8 129090 1.6%
 
93.4875 128164 1.6%
 
91.4625 127630 1.6%
 
93.825 124729 1.5%
 
91.125 121613 1.5%
 
Other values (934) 6902749 84.0%
 
ValueCountFrequency (%) 
0 159703 1.9%
 
3.0375 284 < 0.1%
 
3.375 640 < 0.1%
 
3.7125 594 < 0.1%
 
4.05 811 < 0.1%
 
ValueCountFrequency (%) 
393.525 4 < 0.1%
 
392.5125 4 < 0.1%
 
388.125 8 < 0.1%
 
385.425 4 < 0.1%
 
373.95 4 < 0.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

ACFLIGHTTIMEAMBIENT_1PAR_AC_1PAR_AC_2PAR_AC_3PAR_AC_4PAR_SYS_1PAR_SYS_2PAR_SYS_3PAR_SYS_4WAR_SYS_1COM_SYS_1PAR_SYS_9PAR_SYS_10PAR_SYS_5WAR_SYS_2PAR_SYS_6WAR_SYS_3PAR_SYS_7PAR_SYS_8
0AC19006042011720.044.1825040.928750069.000285.063.625285000.00.01.0010225.7875132.300
1AC19006042111720.042.9837540.072500069.000287.063.625285000.00.01.0010225.7875132.300
2AC19006042211720.542.9837540.072500069.000287.063.625285000.00.01.0010112.7250132.300
3AC19006042311720.542.9837540.072500067.500287.063.625285000.00.01.0010112.7250124.875
4AC19006042306011724.00.000000.000000066.625246.059.875228000.00.00.000079.65000.000
5AC19006042306111724.00.000000.000000066.625246.059.875228000.00.00.000079.65000.000
6AC19006042306211724.50.000000.000000066.625246.059.875249000.00.00.00000.00000.000
7AC19006042306311724.50.000000.000000066.250246.059.875249000.00.00.00000.00000.000
8AC19006042306411725.00.000000.000000066.250246.060.375249000.00.00.00000.00000.000
9AC19006042306511724.50.000000.000000066.250246.060.375249000.00.00.00000.00000.000

Last rows

ACFLIGHTTIMEAMBIENT_1PAR_AC_1PAR_AC_2PAR_AC_3PAR_AC_4PAR_SYS_1PAR_SYS_2PAR_SYS_3PAR_SYS_4WAR_SYS_1COM_SYS_1PAR_SYS_9PAR_SYS_10PAR_SYS_5WAR_SYS_2PAR_SYS_6WAR_SYS_3PAR_SYS_7PAR_SYS_8
8221546AC36ffa91e506812192.50.00.00066.625287.062.375254000.00.00.0000130.6125151.2000
8221547AC36ffa91e506912192.50.00.00066.625287.064.625254000.00.00.0000130.6125151.2000
8221548AC36ffa91e507012192.50.00.00066.625289.064.625254000.00.00.0000130.6125151.2000
8221549AC36ffa91e507112192.50.00.00066.625289.064.625256000.00.00.0000137.0250151.2000
8221550AC36ffa91e507212192.50.00.00067.500289.064.625256000.00.00.0000137.025098.8875
8221551AC36ffa91e507312192.50.00.00067.500289.064.625256000.00.00.0000137.025098.8875
8221552AC36ffa91e507412192.50.00.00067.500290.064.625256000.00.00.0000137.025098.8875
8221553AC36ffa91e507512192.50.00.00067.500290.064.625257000.00.00.0000134.662598.8875
8221554AC36ffa91e507612192.50.00.00066.000290.064.625257000.00.00.0000134.6625112.7250
8221555AC36ffa91e507712192.50.00.00066.000290.062.375257000.00.00.0000134.6625112.7250